Raising of the Computational Efficiency in Glottal Closure Instant Determination through Wavelet Transform*
نویسندگان
چکیده
The determination of Glottal Closure Instant (GCI) is an important task in voice signal analysis. At GCI the vocal folds close the glottis for the airstream coming from the lungs. By opening and closing the airstream passage the vocal folds generate quasi periodic impulse source for the vocal tract system. GCI determine the instantaneous voice signal pitches. They are used for speaker identification, speech recognition, voice pathology investigations, etc. GCI is of considerable importance for modelling the vocal tract too. The latter can be assumed a resonant system in free oscillating mode during closed glottis interval (CGI). In this case the vocal tract can be modeled applying linear prediction methods [1]. Some of the new methods for GCI determination use wavelet transform (WT) apparatus. It is well-known that WT are very appropriate for signal abrupt changes detection [2, 3]. Several types of wavelets have been applied for GCI determination, real [4, 5] and complex valuated too [6]. The latter determine the signal transients through transform modulus maxima and equiphase lines. The most commonly used wavelet in this case is the Morlet wavelet [3, 5, 7]. Unfortunately, its application is complicated because of large computational costs. Several algorithms for improving the computational efficiency have been proposed. The most popular among them is known as the algorithm “a trous” [8]. Another approach uses an approximation of a wavelet with close to Morlet wavelet features [9]. Novel results for GCI determination with exponentially modulated (EM) wavelet are reported in the present work. The wavelet has a close to Morlet wavelet form and allows considerable computational costs reduction without loss of GCI determination accuracy. БЪЛГАРСКА АКАДЕМИЯ НА НАУКИТЕ . BULGARIAN ACADEMY OF SCIENCES
منابع مشابه
Local regularity analysis at glottal opening and closure instants in electroglottogram signal using wavelet transform modulus maxima
This paper deals with singularities characterisation and detection in Electroglottogram (EGG) signal using wavelet transform modulus maxima. These singularities correspond to glottal opening and closure instants (GOIs and GCIs), Wavelets with one and two vanishing moments are applied to EGG signal. We show that wavelet with one vanishing moment is sufficient to detect singularities of EGG signa...
متن کاملAn automatic pitch-marking method using wavelet transform
This paper describes a new automatic pitch-marking method using wavelet transform. This method detects discontinuity in the speech waveform which occurs at the glottal closure instant (GCI). A time domain prosodic modification technique requires an appropriate determination of the synthesis pitch-marks. We evaluated the performance of the newly developed pitchmarking method by using our interna...
متن کاملGlottal flow derivative modeling with the wavelet smoothed excitation
This paper discusses a method for estimating glottal flow derivative model parameters using the wavelet-smoothed excitation. The excitation is first estimated using the Weighted Recursive Least Squares with Variable Forgetting Factor algorithm. The raw excitation is then smoothed by applying a Discrete Wavelet Transform (DWT) using Biorthogonal Quadrature filters, and a thresholding operation d...
متن کاملPhase-Based Methods for Voice Source Analysis
Voice source analysis is an important but difficult issue for speech processing. In this talk, three aspects of voice source analysis recently developed at LIMSI (Orsay, France) and FPMs (Mons, Belgium) are discussed. In a first part, time domain and spectral domain modelling of glottal flow signals are presented. It is shown that the glottal flow can be modelled as an anticausal filter (maximu...
متن کاملRobust glottal closure detection using the wavelet transform
In this work, a time-scale framework for analysis of glottal closure instants is proposed. As glottal closure can be soft or sharp, depending on the type of vocal activity, the analysis method should be able to deal with both wide-band and low-pass signals. Thus, a multi-scale analysis seems well-suited. The analysis is based on a dyadic wavelet filterbank. Then, the amplitude maxima of the wav...
متن کامل